Evolving networks by merging cliques 
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We propose a model for evolving networks by merging building blocks represented as complete 
graphs, reminiscent of modules in biological system or communities in sociology. The model shows 
power-law degree distributions, power-law clustering spectra and high average clustering coefficients 
independent of network size. The analytical solutions indicate that a degree exponent is determined 
by the ratio of the number of merging nodes to that of all nodes in the blocks, demonstrating that 
the exponent is tunable, and are also applicable when the blocks are classical networks such as 
Erdos-Renyi or regular graphs. Our model becomes the same model as the Barabasi- Albert model 
under a specific condition. 

PACS numbers: 89.75.Hc, 05.65.+b 



I. INTRODUCTION 

Complex networks, evolved from the Erdos-Renyi (ER) 
random network are powerful models that can sim- 
ply describe complex systems in many fields such as biol- 
ogy, sociology, and ecology, or information infrastructure, 
World-Wide Web, and Internet 0,13, 0,0. In P art i cuiar : 
some striking statistical properties in real- world complex 
networks have been revealed in recent years. The network 
models reproduce the properties and promise to under- 
stand growth and control mechanisms of the systems. 

One of the striking properties in the real-world net- 
works is a scale-free feature: power-law degree distribu- 
tions are defined as existence probability of nodes with 
degree (number of edges) k; P(k) ~ fc~ 7 with 2 < 7 < 3 
are empirically found 0, 0, 0, Ej • The feature can not 
be explained by the ER model because the model shows 
Poisson distribution. However, the Barabasi- Albert (BA) 
model 0,0 exhibits power-law degree distributions. The 
model is well known as a scale-free network model and 
consists of the two mechanisms: growth and preferential 
attachment, 



n, 



(i) 



denotes the probability that node i is chosen to get an 
edge from the new node, and is proportional to degree of 
node i; fcj. Equation (QJ means that high-degree nodes 
get an even better chance to attract next new edges; the 
rich get richer. The model indicates that P(k) ~ fc~ 3 and 
the degree exponent is fixed. After that, extended BA 
models with modified preferential attachments, including 
weight @ or competitive Q dynamics, and/or local rules 
[Ioj| . rewiring, and adding of edges, are proposed to repro- 
duce statistical properties between BA model networks 
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and real- world networks. In addition, exponential-like 
distributions are often observed in real-world networks 
The distributions are reproduced by an extended 
BA model with aging and saturation effects 0] , nonlin- 
ear preferential attachment rule 0] , or controllability of 
growth and preferential attachment 0|. 

The other of the striking properties is a small-world 
feature: significantly high clustering coefficients C de- 
note density of edges between neighbors of a given node 
and imply clique (cluster) structures in the networks |15| . 
The structures correspond to communities in social net- 
works and network motifs [lfij such as feedforward and 
feedback loops in biological and technological networks. 
Emergence of the clique structures in the networks are 
called "transitivity phenomena" fl?j |. 

In recent years, the transitivity in the many networks 
are actively investigated with statistical approaches and 
it is found that power-law clustering spectra are defined 
as correlations between degree k of a given node and the 
clustering coefficient C of the node; C(k) ~ k~~ a with 
a < 1 in rough is found in the numerical analyses [Tsl 
IT^.l20j . More specifically, a exhibits around 1, suggesting 
a hierarchical structure of the cliques 0, 0] . 

In modeling approaches for the structures, the ex- 
tended BA models with aging [2lJ or triad formation 
[22I 0| and Ravasz's hierarchical model have been 
proposed because of the absence of the structure in orig- 
inal BA networks. In particular, the hierarchical model 
evolves determinably with replication of complete graphs 
as cliques, providing a power-law clustering spectrum; 
C(k) ~ k~ 1 and degree distribution with arbitrary de- 
gree exponent. The model takes into account system- 
atic reorganization of cliques as functional modules or 
communities and the consideration is important for un- 
derstanding developmental processes and controls in the 
systems. 

In this paper, we propose that an evolving network 
model with reorganization of cliques is constitutional unit 
(basic building block). The model is inspired by the elim- 
ination of deterministic growing process in Ravasz's hi- 
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erarchical model, providing high general versatility for 
growing mechanisms. Moreover, the model characterizes 
a relationship between statistical properties and compo- 
sitions of the cliques. 

We explain details of the model in Sec. |nl and the an- 
alytical solutions with mean-field continuous approaches 
of the statistical properties in Sec. IIIII the comparisons 
between the numerical and the analytical solutions in Sec. 
FVl and conclude this paper in Sec. |Vj 



II. MODEL 

Here we present an evolving network model that the 
mechanism has the following three procedures (see Fig. 
d: 

i) We start from a clique as a complete graph with 
a(> 2) nodes. 

ii) At every time step the new clique with the same 
size is joined by merging to existing m(< a) node(s). 
Please note that cliques are merged without adding 
extra links. 

iii) When merging the graphs, the preferential attach- 
ment (PA) rule, Eq. is used to select m old 
nodes and resultant duplicated edge(s) between the 
merged nodes are counted and contribute to PA in 
the next time step. [Please imagine that all edges in 
the clique are stretchable, then a node in the clique 
can reach any existing nodes. Any old nodes can be 
targeted by node(s) in the new clique.] 

With the a > m condition, networks grow in time steps. 






t=2 



t=3 



FIG. 1: Schematic diagram of growth process of the model 
network with a = 3 and m = 1. Each clique is merged through 
common node(s) without adding extra edges. 



III. ANALYTICAL SOLUTIONS 



where S(x) is Kronecker's delta function. To describe 
the degree distribution of our model we take the contin- 
uous mean-field approach used by many authors 0, EJ 0] • 
Since a network in our model evolves in every clique, the 
standard approach can not be applied directly. 

We take the following method called the coarse- 
graining approach to be applied to the standard con- 
tinuous mean-field approach. Let the a-size clique be 



Coarse-graining 




FIG. 
4). 



2: Schematic diagram of coarse-graining method (a 



regarded as a grain with (°) edges (see Fig. |2J). Then, 
edges connecting to m merged nodes in the clique can be 
considered as edges join to other grains, and ( a ~ 2 m ) edges 
in the clique are futile or do not link to the other grains. 
That is, the relationship between Gi are the degree of a 
grain and fcj are expressed as 

Gi = ki + k , (3) 

where k corresponds to ( a ~ 2 m ) ■ 

Now the standard approach can be applied; a time 
evolution of the degree of a Gi can be written as 



dGi , . Gi 



(4) 



where Gj = ^(g)^ The solution of the equation with 
Gi(t — s) — (°) as an initial condition for Eq. @ is 



Gift) 



where 



raja 



(5) 



(0) 



A. Degree distribution 



represents the ratio between the number of merged 
node(s) and that of all nodes in the clique. Please note 
that Eq. © also satisfies the case that the clique is a 
regular graph or a random graph Q because the graphs 
have homogeneous degrees as well as complete graphs. 

By using the continuous approach, the probability dis- 
tribution of Gi can be obtained, 



The degree distribution is defined as the existence 
probability of nodes with degree k, and is formulated 

as 



1 N 



(2) 



P(G) = 



P G^ 



Substituting Eq. © into Eq. Q we obtain 
P{k)=A(a,p)(k + K )-^ 



(7) 



(8) 



where A(a,p) — Q) 1 P / p, demonstrating that the distri- 
bution has power-law fashion and cutoff in lower degrees, 
and the exponent is 



1 



7 : 



(9) 



Equation © shows a direct relationship between the 
exponent of the distribution and the ratio shown Eq. 
©. To establish the correspondence of our model to 
the BA model @, one can assign 2 to a and 1 to m; 
we can calculate that the exponent in the condition is 
(0.5 + l)/0.5 = 3, showing that our model can be recog- 
nized as a more expanded model than BA. 




B. Clustering spectrum 

It is well known that complex networks have other sta- 
tistical properties. Next we step into analytical treat- 
ment of a clustering spectrum that can estimate char- 
acteristics of the hierarchy of networks' modularity, the 
clustering spectrum is defined as 



C{k) 



1 



NP(k) 



N 

E 



d x 5(k - h), 



(10) 



where 8 (x) is Kronecker's delta function, and C% denotes 
the clustering coefficient defined by 



Ci 



Mi 

(*) Hk. 



2M, 



1) 



(11) 



and means density of neighboring edges of the node, 
where ki and Mi denote the degree of node i and the 
number of edges between the neighbors, respectively. 

To derive the analytical solution of Eq. <|10(l . we need 
to obtain the formula of Mi first. The following two 
conditions that Mi increases are need to be examined: 

(i) The condition that node i is merged to a node of a 
clique as a complete graph, and the other node(s) 
of the clique are merged to existing nodes [see Fig. 

Bl(i)]. 

(ii) The condition that the new clique is merged to 
node(s) that are neighboring to node i [see Fig. 
(")]■ 



Since both conditions independently contribute to an 
icrease of il 
both effects, 

Mi =MP +Mf i] . (12) 



increase of MP, Mf 5 can be expressed as the sum of 



Because we assume that a clique is a complete graph, 
condition (i) becomes M^ concrete, 



M 



(i) 



(13) 



FIG. 3: Conditions for increasing of Mi (a = 3, m = 2). The 
existing nodes are filled with black, the new nodes are open 
circles, and the merged nodes are filled with gray. The thick 
lines are edges between nearest neighbors of node i. 



M^ increases when neighboring nodes to node i are 
consecutively chosen by the PA rule. In other words, 
is proportional to degrees of the neighboring nodes. 
By using the average nearest- neighbor degree of node i, 



(k n n)i, we can write the rate equation of M, 
continuous approach p3[ . 



(ii) 



dM, 



(ii) 



dt 




with the 



(14) 



To go to further analytical treatment, both analytical 
and numerical results help to simplify Eq. (|14ll . (k nn )i 
can be expressed by using degree correlation |24| k nn {k) 
denoting that correlations between nodes with k degree 
and the nearest-neighbors degree to the nodes. Based 
on detailed analysis (see the Appendix), we can assume 
that k nn (k) is uncorrelated with k, leading to further 
simplification reported by Eguiluz et al. |25j | . They show 
that (k nn ) = (k 2 )/(k) for uncorrelated networks, where 
(k 2 ) and (k) means the average of the square of k and 
that of k, respectively. The average degree (k) in our 
model is 



(k) 



a(a — 1) 



a — m 



(15) 



With ki(t) ~ (2)(i/s) p given with Eqs. © and ©, the 
average of square of k, {k 2 ), is expressed as 



(k 2 y 



i 



(a — m)t 



(16) 



Equation l|lQfl represents that (fc 2 ) depends on t. With 
Eq. (|15|l , we get the approximation of time evolution of 
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(k nn ) is sensitive to p, 



IV. NUMERICAL SOLUTIONS 



4(l-2p) 



= const. (0 < p < 0.5) 



a(a— 1) 



hit 



n(a-l) j.2p-l 
4(2p-l) r 



(p = 0.5) (17) 
(0.5 <p< 1). 



Substituting this into Eq. (|14|) with the initial condition 
Mj M (t = 1) = 0, we obtain following Eq. (JTHJ) that shows 
dependence of M^ 11 ' on time, 



kft~ 2 P (0 < p < 0.5) 
oc { kf In 3 t/f (p = 0.5) (18) 
kft 4 P~ 3 (0.5 < p < 1). 



Finally the analytical solution of clustering spectrum be- 
comes 



C(k)~^+B(a,p,N), 



(19) 



where B(a,p,N) gives positive value (see Fig. 0} and is 
expressed as 

0^Nc 2p (0<p<0.5) 
B(a, P,N)={ 2^21 In 3 7V G /7V G (p = 0.5) (20) 
ig^iVr 3 (0.5<p<l), 

where iVc = N/(a — m). 




A. Degree distribution 

In order to confirm the analytical predictions, we per- 
formed numerical simulations of networks generated by 
using our model described in Sees. |n] and II I II Figure 
(A) and (B) show degree distributions with differ- 
ent numerical conditions. Solid lines come from Eq. JSJl. 
We show excellent agreement with the theoretical predic- 
tions. 




FIG. 5: Degree distributions P(k). Different symbols denote 
different numerical results and solid lines are depicted by us- 
ing Eq. © with N = 50 000. (A) a = 5. (B) a = 10. 



FIG. 4: The dependence of B(a, p, N) on p with N = 50, 000. 
With larger p and/or a uncertainty to the distribution of clus- 
tering coefficient C(k) becomes larger. 

Figure0|is depicted from Eq. (|20[1 . For smaller p < 0.5 
and a, B(a, p, N) take smaller values, yielding distribu- 
tion of clustering coefficient C(k) ~ from Eq. (|19|l . 
For larger p > 0.5 and/or p, B(a,p,N) increases rapidly 
and become prominent. The dependence of £>(a, p, N) 
on p is the reason that Eq. (|14|l allows to include 

the number of duplicated edges between any two nodes, 
meaning that Eq. I|20(l does not provide the quantita- 
tive aspect, but gives the qualitative prospect. Section 
IIV Bl demonstrates good consistency between the analyt- 
ical and numerical approaches. 



B. Clustering spectrum 

Based on our model we obtain the degree-clustering 
coefficient correlations (clustering spectra) shown in Fig. 
[5] For p < 0.5, the power-law (SF) regime is established 
as predicted in Eq. l|19fl . indicating the hierarchical fea- 
ture in generated complex networks. For p > 0.5, we 
obtain a gentle decay of C{k) for larger k. This decay 
can be explained by the tendency of B(a, p, N) as a func- 
tion of p. With increasing p, B(a, p, N) increases because 
of more overlapping clique, leading to the transformation 
of the C(k) tail from rapid to flat. The gentle decay of 
the tail corresponds to less chance of establishment of 
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hierarchical structure with larger p. 
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FIG. 6: Clustering spectra C(k). Different symbols denote 
different numerical conditions for m with fixed N — 50 000. 
(A) a — 5. (B) a = 10. The insets show the relationship 
between p and a, where a is defined by the exponent from 
rational distribution C(k) ~ k~ a 



C. Average clustering coefficient 

In order to demonstrate that our model can construct 
a complex network with a high clustering coefficient with 
comparing the BA model, we numerically obtain average- 
clustering coefficients defined as C{N) = (1/iV) X)i=i 
Figured shows both results from our model and the BA 
model and C(N) from the BA model was predicted as 
C(N) oc (\nN) 2 /N ~ N-°- 75 (HH^. In contrast, our 
model exhibits the independence of C(N) on N as well 
as higher C(N) with different p. The feature has been 
reported to be found in the real- world networks 0, 
and is prominent property of hierarchical, small- world 
networks. The inset of Fig. [7] shows the decay of C as 
a function of p. As p increases C gently decreases, due 
to the increase of randomness in the network caused by 
more frequent overlapping. 



io- 
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10"' 
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FIG. 7: Comparison of average clustering coefficients C(N) 
from two models. N varies from 100 to 50 000 with fixed 
a = 6. Inset: Dependence of C on p with N = 3 000 and 
a = 6. 



CONCLUSION 



We have proposed the growth network model with the 
merging clique mechanism. The numerical simulations of 
the model have reproduced the statistical properties ob- 
served in the real-world network; power-law degree dis- 
tributions with arbitrary exponent, power-law clustering 
spectrum, and average clustering coefficients are inde- 
pendent of network size. 

In particular, we also have derived the analytical so- 
lution of the exponent following 7 = (p + l)/p by us- 
ing a continuous approach via coarse-graining procedure. 
The solution showed that the degree exponents are deter- 
mined by the only ratio of the number of merging nodes 
to that of clique nodes and had excellent agreement with 
corresponding numerical simulations. 

This relationship for 7 means that the degree exponent 
is controllable by tuning p and implies that the real- world 
networks with decaying degree exponent tend to contain 
a large number of similar modules or communities with 
higher density as well as we may also be able to predict 
a degree exponent 7 when we can estimate the ratio. 

In addition, our research expects that large-scale com- 
plex networks are consist of small scale classical networks, 
which have been called Erdos-Rcnyi or regular graphs, 
suggesting that the classical graph theory is helpful for 
a making and analyzing the growing network model. We 
hope that our model may become a bridging model be- 
tween scale-free networks and classical networks. 

Finally, because of successful reproduction of some re- 
markable characteristics that can be found in biological 
systems, our approach may become a useful tool to pro- 
vide comprehensive aspects of and to disentangle evo- 
lutionary processes of self-organized biological networks 
and biocomplexity. 
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APPENDIX A: DEGREE CORRELATION 

To show proof of the assumption for the uncorrelated 
average nearest-neighbors degree in Sec. IIII Bl we give 
analytical and numerical solutions for degree correlation 
of the model. 

First, we introduce the analytical solutions. Degree 
correlation the represents the average degree of neighbors 
of node(s) with degree k and is defined as 



k' 



k'P(k'\k), 



(Al) 



where the conditional probability P{k'\k) is the frequency 
that a node with degree k connects to a node with degree 
k'. Using Kronecker's delta function, we redefined the 
degree correlation as 



^2ti=\{knri)i X 5{ki fc) 



k n n{k) 



k) 



(A2) 



where (k nn )i denotes the average nearest-neighbor de- 
gree, written as 



(kn 



Rj 

k; 



(A3) 



In Eq. (|A3|) . Ri{t) denotes the sum of the degree of 
neighbors of node i and are represented as 



Ri 



hev(i) 



kh, 



(A4) 



where V(i) correspond to the set of neighbors of node i. 

To get the analytical solution for the time evolution of 
Ri{t) by means of the the rate equation approach [27j . 
we clarify the two conditions that contribute to increase 
of rates in the equation. The conditions are 

i) Node i is selected by the preferential attachment 
(PA) rule. 

ii) Neighbor(s) of node i are chosen by the PA rule. 

Therefore summation of these contributions can be ex- 
pressed as 



dRjjt) 

at 



mIT[(a - m)(a - 1) + (m - l)K,(t)] 

mll h {a-l). (A5) 



E 

/iev(i) 



The first and the second term are derive from condition 
i) and condition ii), respectively. In Eq. (|A5(I . the K,{t) 
expectation values of the degree of a node with the PA 
rule, written as Eq. (|A6|> . are approximated as Eq. (|A7|) 
for the solution 



ic(t) = ^k h n h 



h=l 



a(a — l)i 



dx. 



(A6) 
(A7) 



The degree depends on time t as indicated, 

p 



h{t) = 

derived in Sec. IIII Al Then we get 



(A8) 



= const (0 < p < 0.5) 
"hit (p = 0.5) 

(0.5 < p < 1) 



JC(t) 



a{a—l) 



o(o-l) ±2p-l 
4(2p-l) 1 



(A9) 



which proves the behaviors the depend on p. Further- 
more, the initial condition at time s is the sum of degrees 
of the others of node i in the clique and is given by 



Ri(s) — (a — m — l)(a — 1) + m/C(s). 



(A10) 



Then, we show the solutions as a function of p as fol- 
lows. 

(i) < p < 0.5. In the range, inserting Eq. <| A9|) into 
Eq. (|A5|) . the sum Ri(t) is described as 



Ri(t) 



tuAq ( a 



a(a - 1) \2 



si s 



(All) 



where A = (a - m - l)(a - 1) + m[a(a - 1)/4(1 - 2p)}. 
Substituting this into Eq. (|A3p . the dominant behavior 
of the degree correlation is given by 



k n n \k) 



■In- 



2k 



a — 1 a(a — 1) 



(A12) 



and represents weak assortativity, defined as positive 
correlation between degree k and the average nearest- 
neighbor degree of the node with degree k. Moreover, the 
tendency of the assortativity is determined by a and/or 
P- 

(ii) p = 0.5. In this case, inserting Eq. 1|A9(I into Eq. 
r5|) , the dominant structure of the sum i?j (t) is 



Ri(t) 



m(m — 1) 



In ( - J In* 



a(a-l) (t\\ 



(A13) 



The same step can be applied as shown in case (i), the 
dominant factors for the correlation function become 



knn {k) 



a(a - 2) 
i2 



hi 



2k 



a(a — 1) 



ln7V G , (A14) 



where Nq — N/ (a — m) and also has weak assortativity 
as seen in case (i) when a > 2. In the case of a = 2, 
the model is the same as the BA model, then the degree 
correlation is described asLjfe) = (lniV)/2 in common 
with the reported results (23, |23| ■ 

(iii) 0.5 < p < 1. In this case, inserting Eq. (|A9|) into 
Eq. (|A5|) . the sum Ri(t) is 
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10 1 10 2 10 1 10 2 10 3 10 1 10 2 10 3 

k k k 



FIG. 8: Degree correlations k nn (k) with N = 50 000. (A) < p < 0.5. (B) p = 0.5. (C) 0.5 < p < 1. Symbols correspond to 
the numerical results and the solid lines are given by Eqs. (IA12JI . l|A14(l . and l|A16fl . respectively. Note that (A) and (B) are 
depicted by using single logarithmic plot, and (C) is a double logarithmic plot. 



to (to - 1) a 
- 77^77 — 772 



t 2 P - 



s p -H p 



a(a — 1) . i . 

to— )■ iTS p -H p . 

4(2,0-1) 



4(2p- l) 2 V2 y 

To obtain the dominant factors of the correlation function we take the same procedure as described above, 



to(to — 1) 



4(2/9-1)2 

indicates the uncorrelated feature for the large k. 



1 



TO + 1 — 4/9 



to — 1 ( a(a — 1) 



2/v 



1/P-2' 



(A15) 



(A16) 



Second, we give numerical solutions of the correlation 
of the model. For p < 0.5 [see Fig. El (A) and El (B)], 
average nearest-neighbor degrees of nodes with degree k 
grow logarithmically with increasing k, and the uncor- 
related features are shown for small a and/or p. The 
solid lines correspond to the analytical results described 
above, and are fitted to the numerical ones for large k. 



i 

For p > 0.5 [see Fig. E1(C)], the average degrees have 
no correlations with respect to degree k. As in the pre- 
vious case, the analytical results are represented as the 
solid lines are fitted to numerical ones. On the basis of 
these analytical and numerical evidences, we assume the 
degree correlation is uncorrelated in Sec. IIII Bl 
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